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Reply to Office Action of April 20, 2004 

Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims m the 

application: 
Testing of Claims: 

Please amend claims 1, 21 and 23 as follows: 

1. (cvurently amended): A method of modeling phenomena comprising the steps of: 
analyzing one or more instances of actual phenomena to identify characteristics of the 

instances of the actual phenomena; 

creating a set of taps defining the identified characteristics of t he one or more instances Qf 
the actual phenomena, each tag controlling one or more aspects of one or more modeled 
phenomena to he produced in response to the tags, the tags co ntrolling the aspects of the modeled 
phenomena so as to create characteristics in the modeled phenomena similar to those exhibite d 
by the one or more instances of the actual phenomena : 

arranging selected members of the set of tags in a desired sequence to produce 
phenomena as defined by the sequence of tags; and 

processing the tags in order to produce phenomena having the characteristics defined by 

the tags. 

2. (original): The method of claim 1 wherem the phenomena controlled by the 
tags are characteristics of speech, wherem the step of arranging selected members of the tags in a 
desired sequence comprises placing the selected members of the set of tags into a body of text 
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and wherein the step of processing the tags comprises processing the body of text and the tags to 
produce speech having characteristics defined by the tags. 

3. (original): The method of claim 2 wherein the characteristics of speech are 
prosodic characteristics of speech. 

4. (original): The method of claim 3 wherein each tag imposes a constraint on 
the prosodic characteristics of speech affected by the tag. 

5. (original): The method of claim 4 wherein each of the tags specifies an action 
to be taken and includes parameters defining attributes and associated values providing 
information about the action to be taken. 

6. (original): The method of claim 5 wherein each of the tags may include a 
parameter specifying the location at which the tag takes effect. 

7. (original): The method of claim 6 wherein the set of tags includes tags which 
establish settings which remain unchanged until altered by a subsequent tag. 

8. (original): TThe method of claim 7 wherein the set of tags includes members 
which define the pitch behavior of speech over the course of a phrase. 

9. (original): The method of claim 8 wherein the set of tags includes tags 
defining accents which define the pitch behavior of local influences within a phrase. 

1 0. (original): The method of claim 6 wherein the set of tags includes tags 
defining phrase boundaries which mark boundaries between regions at which tags have effect. 
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11. (original): The method of claim 1 0 wherein a tag which defines a phrase 
boundary prevents tags following the tag which marks the boundary fix)m influencing speech 
components preceding the tag which marks the boundary. 

12. (original): The method of claim 9 wherein each of the tags may include values 
defining type and strength in order to define interaction of the tag with other tags. 

13. (original): The method of claim 12 wherein a tag may compromise its shape, 
average pitch or both depending on the value defining type. 

14. (original): The method of claim 8 wherein the step of processing the tags 
includes establishing a phrase curve by creating and solving equations defined by tags which 
specify changes in pitch and tags which specify rates of changes in pitch. 

15. (original): The method of claim 14 wherein the body of text and the tags are 
processed one minor phrase at a time. 

16. (original): The method of claim 15 wherein processing of a phrase includes 
using values describing properties prevailing near the end of an immediately preceding phrase. 

1 7. (original): The method of claim 9 wherein the step of processing the tags 
includes establishing a pitch curve by creating and solving equations defined by tags which 
specify accents. 

18. (original): The method of claim 17 wherein the body of text and the tags are 
processed one minor phrase at a time. 

1 9. (original): The method of claim 1 8 wherein processing of a phrase includes 
using values describing properties prevailing near the end of an immediately preceding phrase, 
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20. (original): A method of processing a body of text including tags defining 
prosodic characteristics of speech to be produced by processing the text, comprising the steps of: 

extracting the tags from the text; 
creating a set of equations defining a phrase curve; 
solving the set of equations to produce the phrase curve; 
creating a set of equations defining a pitch curve; 
solving the set of equations to produce the pitch curve; 

mapping linguistic concepts represented by the phrase curve and the pitch curve to 
acoustical observables; and 

performing a nonlinear transformation to adjust the prosodic characteristics defined by 
tags to human perceptions and expectations. 

2 1 . (currently amended) : A method of defining a set of tags specifying prosodic 
characteristics of speech of a target speaker, comprising the steps of: 

selecting a body of training text; 

receiving speech representing reading of the training text by the target speaker to fonii a 
training corpus, the training corpus representing actual sounds produced by the reading of the 
training text by the target speaker and exhibiting prosodic characteristics of actual speech of the 
target speaker : 

analyzing the training corpus to identify prosodic characteristics of the training corpus; 

and 

creating a set of tags defining the identified prosodic characteristics of the training corpus. 
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22. (original): A method of placing tags in text for text to speech processing 

comprising the steps of: 

placing tags in a body of training text to model prosodic characteristics of a training 

corpus produced by reading of the training text; 

analyzing the placOTient of the tags in the training text to develop a set of rules for 
placement of tags in text; and 

applying the rules to text for which text to speech processing is desired to place tags in 
the text in order to produce speech having desired prosodic characteristics. 

23- (currently amended): A text to speech system for receiving text inputs 
comprising text to be processed to generate speech and tags defining prosodic characteristics of 
the speech to be generated, comprising: 

a prosody tag generation component to analyze a trainin g corpus to identify 
characteristics exhibited bv one or more readings of text by o ne or more target speakers and to 
generate a set of tags defining the identijSed characteristics: 

a text input interface for receiving the text input; 

a speech modeler operative to process the text inputs to produce speech havmg the 
prosodic characteristics specified by the tags, such that the speech produced by the speech 
modeler is similar to that of the one or more target speakers ; and 

a speech output interface for producing the speech output. 

24. (original): The system of claim 23 wherein the speech modeler is further 
operative to process a training corpus representing a reading of text by a target speaker to 
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produce tags defining prosodic characteristics of the training corpus and use the tags to produce 
speech having prosodic characteristics typical of the target speaker. 

25. (canceled) 

26. (canceled) 

27. (original): The method of claim 2 wherein each tag imposes a constraint on 
motion of an articulator used to produce speech. 

28. (original): The method of claim 1 wherein each tag imposes a constraint on 
modeled muscular motions used to simulate gestures or facial expression. 

29. (canceled) 

30. (canceled) 

3 1 . (original): The method of claim 9 wherein one or more tags are placed within 
a proper noun comprising two or more words, each such tag producing prosody indicating to a 
listener that the proper noun is to be interpreted as a single entity rather than as more than one 
entity. 

32. (original) : The method of claim 3 1 wherem the tag produces an increase in 
the pitch and speed of speech over the speech affected by the tag. 

33. (original): The method of claim 9 wherein one or more tags are placed to 
produce a word having prosody indicating that the word requh^ confinnation, 

34. (original): The method of claim 33 wherein the prosody indicating that the 
word requires confirmation is characterized by a relatively high and increasing pitch across the 
word requiring confirmation. 
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